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Abstract. We develop novel techniques using abstract operator 
theory to obtain asymptotic formulae for lattice counting problems 
on infinite- volume hyperbolic manifolds, with error terms which are 
uniform as the lattice moves through "congruence" subgroups. We 
give the following application to the theory of affine linear sieves. 
In the spirit of Fermat, consider the problem of primes in the sum 
of two squares, f(c,d) = c 2 + d 2 , but restrict (c, d) to the orbit 
O = (0,l)r, where T is an infinite-index non- elementary finitely- 
generated subgroup of SL(2, Z). Assume that the Reimann surface 
r\H has a cusp at infinity. We show that the set of values f(O) 
contains infinitely many integers having at most R prime factors 
for any R > 4/(6 — 8), where 9 > 1/2 is the spectral gap and S < 1 
is the Hausdorff dimension of the limit set of T. If S > 149/150, 
then we can take 9 = 5/6, giving R = 25. The limit of this method 
is R = 9 for S — 9 > 4/9. This is the same number of prime factors 
as attained in Brun's original attack on the twin prime conjecture. 
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1. Introduction 

Many of the most enticing problems in number theory amount to 
finding primes or almost-primes (numbers having few prime factors) in 
"thin" subsets of the integers. By thin, we mean that the number of 
elements in the given set of size not exceeding a height T is bounded by 
T s , where 5 is some constant less than 1. See the papers |Pu53| IChe73[ 
IIwa78[ IFI98| IHB01J which are landmarks in sieve theory, producing 
primes or products of at most two primes in thin sets. 

It is our main goal to exhibit such a set arising from the orbit of an 
affine linear group action and sift for elements having few prime factors. 
Our starting point is the fundamental work of Bourgain, Gamburd, and 
Sarnak [BGS06, BGS08, Sar07j on the so-called affine linear sieve: 

Theorem 1.1 ( |BGS06j ). Let T C 5X 2 (Z) be any non- elementary^ 
subgroup, let q G Z 2 be nonzero, and let O = q ■ T be a T-orbit. Let 
f : Z 2 — > Z be any polynomial. Then there exists an R < oo, depending 
on all of the above data, such that there are infinitely many points in 
the set f(0) having at most R prime factors. 

Notice that there are no congruence conditions, nor conditions on 
/ (such as irreducibility) - these are all factored into R, which is left 
completely unspecified. There are three main ingredients: 

(1) Number Theory - the combinatorial sieve, 

(2) Algebra - Strong Approximation and Goursat's Lemma, and 

(3) Combinatorics - counting by wordlength in T and extending the 
recently established expander property of Bourgain-Gamburd 
[BG07J (with an unspecified spectral gap) to square-free moduli. 

The lack of specificity of the spectral gap (and arbitrariness of the 
choice of orbit O and function /) induces a lack of specificity of the 
number R of prime factors. 

In this paper, we select a particular orbit O and function /, and give 
a precise bound for the number R of prime factors by replacing input 
(3) above with an archimedian count and using Gamburd's explicit 
5/6-th gap |Gam02j in place of the unspecified spectral gap. 

This amounts to a hyperbolic lattice point counting problem, requir- 
ing uniform error estimates as the lattice moves through "congruence" 
subgroups. In the interest of having a thin set, we are compelled to 



Recall that an elementary group T C SL^Z) has the property that any two 
elements g,h eT of infinite order have | tr ((7^*7 1 /1. 1 ) | = 2, see |Bea83| . These are 
the "abelian" groups, such as purely hyperbolic groups, or purely elliptic groups (a 
torus generated by a single elliptic clement). 
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work in infinite volume, where standard spectral methods (decomposi- 
tion into Maass forms and Eisenstein series) cannot be applied. There- 
fore we develop novel "soft" methods using only operator theory (the 
abstract spectral theorem; see e.g. |Hal63j) and spectral information 
from Patterson-Sullivan theory |Pat76t ISul84] and Lax-Phillips |LP82] 
to circumvent explicit knowledge of a spectral decomposition. 

Remark 1.2. Note that an archimedean count is also used in Theorem 



2 of [BGS06J; Corollary 1.8 in the present paper is closely related to 
this result. The difference is counting in a group versus counting in 
an orbit. The problem of counting in the group in infinite volume 



was solved in Lax-Phillips [LP82J. To count in an orbit, one faces the 



serious issue of a stabilizer, as we discuss below. 



To state our main theorem, we require some notation. Our function 
of choice will be the sum of two squares: 

f{c,d) :=c 2 + d 2 . (1.1) 

Let T C SL2(Z) be a non-elementary finitely-generated Fuchsian group. 
We will soon turn our attention exclusively to groups of the second kind 
- ones having infinite co-volume - but we do not make this restriction 
just yet. Denote by the set of elements of T which stabilize infinity: 

7er| 7 =f I *)). (1.2) 



1 , , 

Let 5 = S(T) < 1 denote the Hausdorff dimension of the limit set of T 



and 6 < 8 be the spectral gap (see {2.2). The index of T in SL 2 (Z) is 
finite if and only if 5 — 1. 

Let O be the orbit of bottom rows of T, 

O:=(0,l)r=|(c,d): ( * rf) Gr °°\ r }' (L3) 

and for a height T > 1 let 0(T) denote the set of orbital points not 
exceeding this height: 

0(T) := {(c,d) G O | c 2 + d 2 <T}. (1.4) 

For R > 1 let 0(T, R) be defined by 

0(T,R) := {(c,d) G 0(T) \ f(c,d) has at most R prime factors}. 

Recall the notation 



/ x 9 for g < / < 
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Theorem 1.3 (Main Theorem). Let T C 5X2 (Z) be a non-elementary 
finitely- generated Fuchsian group, let 5 be the Hausdorff dimension of 
its limit set, 9 the spectral gap, and 0(T) and 0(T, R) defined as above. 
Assume is nontrivial (then 5 > 1/2, |Bea68] ). As T — > oo ; 

(1) There exist constants Cq > and 77 > such that 

\O(T)\=c T 5 + 0{T & - n ). (1.5) 

(2) For any fixed R > 4/(5 - 9), 

\0{T,R)\^T & /\ogT. (1.6) 

Part (1) above tells us that the orbit is thin if and only if 5 < 1, 
and is the quintessence of our necessity to work in infinite co- volume. 
In |BGS06j . it is proved that if 5 > 1/2, then there always exists some 
spectral gap 9 < 5. Gamburd [Gam02j shows that if 5 > 5/6 then we 
can take 9 = 5/6. If on the other hand T is a finite co- volume group 
and moreover a congruence group, then Kim-Sarnak [KS03J allows 9 = 
1/2 + 7/64 « .609. The following corollary is immediate. 

Corollary 1.4. .4s T — > oo ; 

\0(T,R)\ >T 5 /logT 

for 

(i) R = 25 if 5 > 149/150 (setting 9 = 5/6), 

(ii) R = 11 ifT is a finite index congruence group (where 5 = 1 and 
9 = 39/64), and 

(iii) R = 9if5-9> 4/9. 

It is known that in dimension two, infinite co-volume groups exist 
with 5 arbitrarily close to 1 (e.g. |Gam02j ) so part (i) above is not 
vacuous; moreover the example of T with S arbitrarily close to 1 given 
in |Gam02] does in fact contain unipotent elements. Part (ii) is not 
particularly interesting, since one can vastly improve R for a finite 
index congruence group with classical techniques. We include it here 
only for comparison. It is not known whether part (iii) above is vacuous 
(even taking 5 = 1 requires 9 < 5/9 ~ .555 and is outside the reach of 
Kim-Sarnak), but R = 9 is the limit of our methods, and coincidentally 
is precisely the number of factors attained in Brun's original attack 
[Brul9j on the twin prime conjecture. 

Remark 1.5. Determining membership in O amounts to expressing 
an element in V as a word in the generators - not an easy task. We 
are finding numbers with few prime factors despite having extremely 
limited knowledge as to which numbers appear! 
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Remark 1.6. The choice of the function f(c, d) = c 2 + d 2 is cosmetic; 
our methods apply to an arbitrary polynomial /, and we plan to de- 
tail this generalization in a future publication. That said, our current 
choice of / is natural, not only historically (indeed the problem of 
finding primes in sums of two squares dates back to Fermat) but also 
because, as we shall see, it is everywhere unobstructed in the affine 
linear sieve (akin to looking for primes congruent 1 modulo q - there 



are no obstructions for any q). See Remark A. 2 



Remark 1.7. The requirement in the Main Theorem that be non- 
trivial is undesirable. Indeed the conclusions should hold without this 
assumption, and we are currently working to remove it by other meth- 
ods. See Remark [B. 21 

More importantly, unipotent elements furnish an affine injection into 
our orbit, enabling more classical sieve techniques. Precisely, if a, 7 G V 
and a ^ I fixes infinity, then /((0, l)7a ra ) is a quadratic polynomial in 
n. These are known |Iwa78] to contain infinitely many numbers with at 
most two factors! Moreover by varying 7, one can accrue a Zariski dense 
set of 2-almost primes, in the sense of [BGS06J. Therefore we state one 
more immediate corollary which cannot be deduced by "cheating" with 
unipotents: 

Corollary 1.8. Let T, 5 and 0(T,R) be as in the Main Theorem. 
Then for R = 1 we have the following upper bound for the number of 
primes in f : 

\0(T,1)\ <T 5 /logT, as T^ 00. 

This is off by a constant multiple from the expected asymptotic formula. 

This paper is organized as follows. In the next section we give back- 
ground material on Strong Approximation, the geometry and spectra 
of infinite co-volume groups, and the weighted linear /3-sieve. In §3 
we prove a certain Main Identity, which shows how to grow the lattice 
point count at time T from that at small times via the Laplace op- 
erator (this is the key to circumventing an explicit spectral theorem). 
In §4 we collect preliminary facts about infinite volume lattice point 
counts before proving the Main Theorem in §5. Some technical issues 
are reserved for the Appendices. 

Acknowledgements. I thank my advisors Dorian Goldfeld and Pe- 
ter Sarnak for their guidance, encouragement, and inspiration. I am 
grateful to the anonymous referees and Hee Oh for detailed comments 
and corrections to an earlier draft of this document. 
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2. Background Material 

2.1. Strong Approximation. Our first ingredients are algebraic in 
nature. We require the Strong Approximation Theorem of Matthews, 
Vaserstein, and Weisfeiler |MVW84j . stating that if L C SL n (Z) is 
Zariski dense in SL n (Z) then the projection of L on SL n (Z p ) is dense for 
all but finitely many primes p. Recall that the ring of p-adic integers Z p 
is the inverse limit of the finite rings Z/p fc Z. In particular, this means 
L is onto SL n (Z/pZ). Actually in dimension two this can be done by 
more elementary methods |DSV03j . 

We also require Goursat's Lemma which states the following. Let 
Gi, (?2 be groups, and let if be a subgroup of G\ x C7 2 such that the 
two projections pj : H — ► Gj, j — 1,2 are surjective. Let Nj be the 
kernel of pj. One can identify Ni as a normal subgroup of G2, and N 2 
as a normal subgroup of G\. Then the image of H in G1/N2 x G2/N1 is 
the graph of an isomorphism Gi/N 2 ~ G , 2 /A^ 1 . As the group PSL 2 (F) is 
simple for a finite field F satisfying |F| > 4, this implies that if p\ 7^ p 2 
are two primes and L C SL 2 (Z) projects onto both 5X 2 (Z/piZ) and 
SL 2 (Z/p 2 Z), then L projects onto SX 2 (Z/pip 2 Z). 

We follow [BGS06] in recording these two facts as follows: 

Theorem 2.1. Let Y C 5X 2 (Z) be nonelementary . Then there exists 
a number 03 which is the product of a finite set of "ramified" primes 
such that if q = q'q" is square-free with q'\*B and (q",*B) = 1 then the 
projection of T in 5X 2 (Z/gZ) is the product G q i x S , L 2 (Z/g // Z), where 
G g i is the projection ofT in SX^Z/g'Z). 

2.2. Infinite-volume geometry and spectral data. Let Y C SL 2 (Z) 
be a finitely-generated non-elementary Fuchsian group. Then Y acts on 
the Poincare upper half plane EI by fractional linear transformations. 
In dimension two, being finitely generated is equivalent to being geo- 
metrically finite, i.e. that the Riemann surface T = Y\W has finitely 
many bounding sides |Bea83j . 

As the action of Y on EI is discrete, there are no limit points in 
EI. There are however limit points in the boundary E = RU {°°}- 
The set of all limit points of Y is called the limit set A = A(T). It is a 
Cantor-like fractal and has some Hausdorff dimension 5 = 5(Y) e [0, 1]. 
A geometrically finite group has 5 < 1 if and only if T has infinite 
hyperbolic volume (i.e. Y has infinite index in SL 2 (Z)). 

If we label the generators of SL 2 (Z) by T : z 1— > z + 1 and 
S : z 1— > — 1/z, then a prototypical example of the type of group we 
have in mind is the Hecke group Y = (T 4 , S). This group has funda- 
mental domain T = {z G EI | \z\ > 1, |£He(z)| < 2}, whose vertical 
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Figure 1. A fundamental domain for T = (T 4 , S). 

strips touch the real line at a free boundary and clearly contribute in- 
finite hyperbolic volume. See Fig. [TJ We will assume throughout that 
T has a cusp at infinity, as in this example. 

As usual, T is equipped with a hyperbolic volume element dz = 

The positive-definite Laplace-Beltrami operator A = — y 2 (jj^z + j 

of J- acts in the space of C°° functions with compact support C^(T), 
and has a unique self-adjoint extension to an unbounded operator on 
L 2 (F). Denote the spectrum of A on L 2 (r\H) by Spec(r\H). 

The spectrum of A below 1/4 consists only of finitely- many point 
eigenvalues and the tempered spectrum contained in [1/4, oo) is purely 
continuous [LP82J. Notice that in finite volume 5 = 1 and the base 
eigenvalue is A = 0, corresponding to the constant function (f> = 
1 / y vol( J-) , scaled to have unit L 2 -norm. Neither does this make sense 
if vol(jF) = oo nor is any non-zero constant function square-integrable 
in this case. It follows from the work of Patterson |Pat76| ISul84j that 
5 > 1/2 if and only if 

s P ec(r\e) n [o, 1/4) ^ 0, 

in which case Ao = 6(1 — 5) is the base eigenvalue of A. This eigenvalue 
is isolated, has multiplicity one, and any associated eigenfunction is of 
constant sign on T] in particular we can choose it to be nonnegative. 
The role of the constant function (volume) is then played by the base 
eigenfunction <fio which Patterson determined explicitly as the integral 
of a Poisson kernel against the so-called Patterson-Sullivan measure /j,, 
supported on the limit set A: 

If T has a cusp (which we assume it does) then S > 1/2. 
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Let T(q) denote the principal "congruence" subgroup of T of level q, 

T(q) : = {7 G T : 7 = Z(modg)}. (2.2) 

This is of course still a Fuchsian group of the second kind, but has finite 
index in T (and therefore also has the same limit set and Hausdorff 
dimension - every Cauchy sequence in EI under the action of T has a 
corresponding sequence under T(q) with the same limit point). The 
inclusion T(q) C T induces the reverse inclusion 

Spec(r\H) c Spec(r(g)\H). 

In particular this means the base eigenfunction 0o is an "oldform" on 
T(q) (but must be rescaled to have unit L 2 -norm). 

Corresponding to any point eigenvalue A G Spec(r(g)\HI) is an eigen- 
function, G L 2 (r(g)\EI). It may be the case that <fi (which a priori 
is only r(g)-automorphic) is also automorphic with respect to V. In 
this case we we call an "oldform" and A is an "oldvalue". In the 
opposite case we call <fi a "newform" and A a "newvalue". Denote 
by Spec(r(g)\H) new the subset of Spec(r(g)\H) consisting of "new" 
eigenvalues. 

Definition 2.2. We say that Y has a spectral gap 6 G [1/2,5) if 
there exists 23 G N such that for q = q'q" square-free with g'|*B and 
(q", OS) = 1, we have 

Spec(r(g)\e)„ ew n (0, 0(1 - 9)) C Spec(r(g , )W) new . (2.3) 

Note that this definition of the "spectral gap" is not the conventional 
one (for which see, for example, |EMV07[ IKS08j ). 

Collecting the results in |BC07[ [BGS06J and their extension from 
prime to square- free of [Gam02] we have: 



Theorem 2.3 ( [Cam02j IBGDTj IBCS06] ) . Let V C SL 2 (Z) and 5 > 1/2 



be the Hausdorff measure of its limit set. 



(1) There exists a spectral gap 8 G [1/2,5) such that (2.3) holds 
with 05 = 1. 



(2) If 5 > 5/6 then (2.3) holds with 6 = 5/6 and the number 03 is 



precisely the one which appeared in Theorem 2.1 



The case V = SL^Z), 03 = 1 and 9 = 1/2 is the celebrated (and 
unsolved) Selberg 1/4-Conjecture |Sel65] . which in modern parlance is 
a consequence of the generalized Ramanujan Conjectures. 

We now record the abstract spectral theorem for unbounded self- 
adjoint operators as follows. 
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Theorem 2.4 (Abstract Spectral Theorem). There exists a spectral 
measure v, supported on Spec(r\H) and a unitary spectral operator 
~: L 2 (r\H) -> L 2 ((0, oo), dv) such that: 

(1) We have the Abstract ParsevaVs Identity: for 0i, 2 £ -^ 2 (r\EI), 

= ^) L2(Spcc( r\e)^) (2 ' 4) 

(2) T7ie spectral operator ~ is diagonal with respect to the Laplacian: 
for<pe L 2 (r\H) and A G Spec(r\H) 

A0(A) = A0(A). (2.5) 

(3) If \ is a point eigenvalue of multiplicity one with associated L? 
eigenfunction A of unit norm, then for any G L 2 (r\H) ; 

0(A) = (0,0a). (2.6) 

2.3. Combinatorial Sieve. Let A be a sequence of non-negative real 
numbers {a n } n of which all but finitely many are zero. For R > 1 let Zr 
denote the set of positive integers with at most R prime divisors. The 
main objective in sieve theory is to determine lower bound estimates 
for 

nez R 

given knowledge of how A is distributed along each of the arithmetic 
progressions O(modg) for square-free q. In the following setup of the 
sieve, there are many parameters. Their heuristic meaning is as follows. 

Let q > 1 be square-free and collect the elements of A whose index 
is divisible by q via 

A q := K G A | n = 0(g)}. (2.8) 

Of course Ai = A. The parameter T is the cut-off point after which 
all a n 's are zero. The parameter 25 G N is the product of a fixed finite 
set of "ramified" primes. We will decompose q into the Q3-part and 
the rest: q = q'q" with g'|Q3 and (q",V&) = 1. The parameter X is an 
approximation to |^4| := J2 n a n , and uj : N — > [0, 1] represents the local 
density at q. Thus u(q)X is an approximation to 

\Aq\ ^ ^ Q>n- 
n=0(q) 

At the 03-part, there is a lower order term, X q > such that 
r(q) := - u(q)(X + X q ,) 



10 



ALEX V. KONTOROVICH 



is an error term. The error is small on average: there is a sieving level 
Q such that the total contribution from the error terms up to Q is a 
power savings off the main term X. 

Precisely, we require the following conditions: 

(51) X x | ^4 1 := ^ n a n and 03 G N is a fixed natural number. There 
is a parameter T > 1 such that 

a n = if n > T. (2.9) 

(52) The function u : N — > [0, 1] satisfies uj(1) = 1 and u(q) < 1 for 
q > 1. Moreover, u is multiplicative away from 23. By this we 
mean that for any q square- free, write q = q'q" with q'\*B and 
(q", 23) = 1. Then u;(g) = u(q f ) Y\ pW > w(p). 

(53) There exists a fixed constant < oo such that for 2 < v < z, 
we have the following local density bound: 

n- ? |.(i— &>))-'< (is) (!+*)• 

(54) For any divisor g'|Q3 let A^/ satisfy A^/ C^ 1 11 for some r/ > 0. 

(55) Define r(g) := \A q \ — u(q) (X + X q i) and assume that for some 
parameter Q > 1 and e > 0, 

^|r(g)|«^ 1 - £ . 

The following Theorem is a simple consequence of Theorem 3.7 on 
page 63 in |Iwa96j . We derive it from the original in Appendix |A| 

Theorem 2.5. Let A be as described above. Then 

J2 OnXXY[(l-u(p)), (2.10) 



neZ R p<Q 



for any R satisfying 



R > 21ogT/logQ. (2.11) 



In our application, Goursat's Lemma demonstrates the multiplica- 
tivity of the local density lo in (q2J), and Strong Approximation allows 
us to compute to explicitly and verify (S^]). The finite collection of 
primes comprising 03 which may have exceptional eigenvalues in The- 
2.3 are directly responsible for the extra terms X q > in (^4]) but 



orem 



their contribution is harmless to the inclusion-exclusion of the sieve. 
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3. The Main Identity 

Let T C SL.2(Z). Recall the classical fact that for 7 = ( c d ) 
we have 



\cz + d\ 2 ' 

Fix T > 1 and define the characteristic function 

fl if , mW >l/T 

I (J otherwise, 

so that 

x (71) - I 1 if ° 2 + rf2 <T 

1 1 otherwise. 

Let Too C T be the maximal subgroup stabilizing infinity; clearly \t is 
Too-invariant. 

We average \t over the group Y 

F T (z):= XtM, (3.2) 

7eroo\r 



so that we can recover the lattice point count (1.4) via 

F T (i) = \0(T)\. 

Clearly Ft is T-invariant, so is well defined as a function on r\H. 

Lemma 3.1. Ft G L 2 (r\M) if and only ifT^ is nontrivial. 

Proof. If Too is nontrivial, let N be the finite interval corresponding to 
the width of the cusp at infinity. Otherwise set iV = K. By unfolding 
we have 



|F T || 2 = I F T {z)F T {z)dz = [ V X T(jz)F T (z)dz 

Jr\m y r \H 7grooVr 

= / X T(z)F T (z)dz = [ [ F T (z)dx- 



dy 
,2 ' 



ir^xm Ji/T Jn V 

For Jva(z) sufficiently large, only the term 7 = I contributes to the 
sum (3.2), and so Ft(z) = 1. It is now clear that the above integral 
converges if and only if iV 7^ M. □ 

Instead of accessing Ft directly, we will seek an identity which "grows" 
the count at time T from the count at time T = 1 (and at some other 
time, T = b). Let <fi G L 2 (r\EI) be an eigenfunction of the hyper- 
bolic Laplace-Beltrami operator A = —y 2 (j^z + Jp-j with eigenvalue 
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A = s(l — s) < 1/4. By unfolding the following inner product and using 
the fact that the constant (or any) Fourier coefficient of satisfies the 
same differential equation as itself, we have 



(F T ,<t>) = / F T (z)<P{z)dz = / X T(z)<Kz)dz 
'r\H Jt^m 

i/T \Jn j y Ji/t y 

= A+T' + BtT 1 -', 

where a, (3, A^, and Bs are some constants depending on the eigen- 
function 0. (As Fx is like a truncated Eisenstein series, this calculation 
is just the incomplete Mellin transform of the constant term of 0.) 

Following the methodology of Selberg |Sel56] . we seek an identity 
which depends only on the eigenvalue A or s but not on the eigenfunc- 
tion 0. Reformulate the above identity as a dot product of vectors 

(F T , 0) = A S T S + B^T l - s = (T s , T 1 ' 3 ) ( ^ ) (3.3) 



and write it at time T = 1 and some other time T = b > 1: 

(F 1; 0) = A^ + Bs, 
(F b ,<f>) = A<pb s + B^- S , 

or 

(FiA) \ (I 1 \( A 4 
(F b ,<f>) ) - ^ V )\B 4 

Notice that the matrix on the right hand side only depends on the 
eigenvalue s. We multiply both sides by the inverse matrix 

As \ _ 1 ( b 1 -* -1 \ / (F 1; 0) 



B<p J b 1 -* -b s \ -b s 1 J \ (Fi 



b- 



and insert this into (3.3): 



is, 1 ( b l - s (F l5 0) 



^ = ^ T >¥^b*\-b* 1 

= K T {s)(F l ^) + L T {s)(F b ^), (3.4) 

where 

rpsUi—S 'T'l — SLS 
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are functions which only depend on the eigenvalue, A = s(l — s). 
Notice that for T > 1, s G (1/2, 1], and b fixed 

K T (s),L T (s) <T s . (3.6) 

For s = \ + it we have 



K ( \ ^i/ 2 sin(t log T/ logo) 
sin(t logo) 

'T\ 1/2 sin(tlogT) 



b / sin(tlogo) 



Choosing 6 = b(T) such that G Z ensures that the functions above 
are 

<T 1/2 logT. 

For example, we can take 

6=exp (Ti^r) £(1,e1, (37) 

where [•] is the ceiling function, returning the smallest integer not less 
than its argument. In particular, 

b < 3. 

As A = s(l — s), we abuse notation by writing 

K T {\) and L T (X) 

in place of 

K T (s) and L T (s). 

This should cause no confusion. Just as one can exponentiate a matrix, 
one can define a function of a differential operator, which is itself a 
differential operator. So as <fi is an eigenfunction satisfying A0 = A0, 
we have 

K T (A)0 = K T {\)<f>, 

where the left hand side acts by differentiation and the right hand side 
is multiplication by a function of the eigenvalue. The same holds for 

Since the Laplacian A is self-adjoint, we have from (3.4) 

(F T ,<t>) = K T (X)(F 1 ,<f>)+L T {X)(F b ,(f>) 

= (F 1 ,K T (A)0) + (F fe ,L r (A)0) 

= (F 1 ,K T (A)cj ) ) + {F b ,L T (A) ( j ) ) 

= (K T (A)F 1 ,(f>) + (L T (A)F h ,(f>) 

= (K T (A)F 1 + L T (A)F bl 4>) . 
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This identity holds for any eigenfunction in the point spectrum, and 
so it should be the case that 

F T (z) = K T (A)F 1 (z) + L T (A)F b (z) (3.8) 

holds in general. Notice that the T dependence has been entirely re- 
moved from F on the right hand side and only appears in the spectral 
operators Kt and Ly! It should now be clear what we mean by gath- 
ering information about F at time T using only the Laplacian and F 
at small times. We reserve the rest of this section for the proof of this 
Main Identity. 

Theorem 3.2 (Main Identity). For fixed T > 1 let Ft be defined by 
(3.2). Then there exists a number b < 3 and functions Kt and Lt 
satisfying: 

L T {\) < K T {\) < T s 
for A = s(l — s) and s G (1/2, 1], and 

L T (A) <# T (A) <T 1 / 2 logT 



for A > 1/4, such that (3.8) holds for almost every z. 



The argument above (3.8) proves this identity along the point spec- 
trum. If we had an explicit spectral theorem in infinite volume, we 
would just need to carry out similar computations to prove this iden- 
tity on the continuous spectrum (for a finite co-volume group, this is 
easily achieved via the Eisenstein series). Instead, we will use ideas 
from "almost" eigenfunctions and perturbation theory for the proof, 
which will occupy the remainder of this section. 

We begin with a technical lemma. Note that from now on, is not 
assumed to be an eigenfunction of A, only that G L 2 (r\H). 

Lemma 3.3. For any G L 2 (r\H) and any A = s(l-s) > 0, A ^ 1/4 
there exist constants A and B such that 

(F T ,<f>) = AT S + BT l - s + A , r ,r (||(A - A)0||) . (3.9) 

If\ = 1/4 then 

(F T , <f>)=A T 1 ' 2 + B T l > 2 log T + A , T , r ( || ( A - A)0|| ) . 

This is an explicit computation, applying the method of variation 
of parameters and estimating the inhomogeneous component (which 
relies on being non-trivial). We present the details in Appendix [Bj 

Remark 3.4. One should think of as an "almost" eigenfunction with 
eigenvalue A. Then the error term in (3.9) should be small. This is a 
heuristic only; the argument applies for general and A, and we will 
soon see that such an application is necessary. 
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The following proposition says that the difference of the right and left 
hand sides of (3.9) has no correlation with any almost eigenfunction. 

Proposition 3.5. Let T, b, and Ft be as above and set 

G T = F T - K T (A)F 1 - L T (A)F b . (3.10) 

Then for any G L 2 (r\H) and any A > we have 

(G t ,0><a,t ||(A-A)0||. (3.11) 

Proof. Fix an arbitrary G L 2 (r\H) and A = s(l — s) > 0, and let Gt 
be defined by (3.10). Assume A ^ 1/4 (the computation in the the case 
of A = \ is similar). Consider the following trivial identity obtained by 
adding and subtracting identical terms: 

(G T ,0> = (F^^-iA^ + B^T 1 - 3 ) 

- ((K T (A)F 1 , 0) - K T {\) (F u 0» (3. 
-«L r (A)F 6 ,0>-L T (A) (F h ,<j>)) 

-(K T (X) (F 1 , ( p)-K T (\)(A 4> + B 4> )) 

- (L T (X) (F b , 0) - L T (X)(A^b s + B^- s )) 

+ (A^T S + B<pT l ~ s ) - K T (X)(A$ + B+) - L T (X)(A^b s + B^b 1 ' 

The bottom row is zero by construction of and Lt in (13.51). The 



top row is <a,t || (A — X)(j)\\ by Lemma 3.3 the fourth and fifth 

rows. It remains to understand the second and third_rows. 

Let denote the spectral transform of and let Ft be the spectral 
transform of Ft- As A is fixed, let A' be in Spec(r\H). By the Mean 
Value Theorem and Cauchy-Schwarz, the second row is 



fl3T2p = (K T (A)F l , 0) - K T (X) (F 1} 0) 
' K T -F\,(j))— K T {X) (K, 



= [ (Kt(X') - ir T (A))^(A')0(A / )^(A / ) 

JSpec(r\H) 

«a,t / (A' - A)0(A')^(A')^(A / ) 

JS P ec(r\e) 

« ||(A-A)0||||F 1 ||. 
The calculation for the third row is identical and we are done. 

□ 



Finally, we show that a function which is uncorrelated to any almost 
eigenfunction is zero. 
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Proof of the Main Identity. We aim to show that Gt defined in (3.10) 
vanishes almost everywhere. 

Since (3.11) holds for any 0, we are free to choose our 0. Equiva- 
lently we may choose its spectral transform 0, so we make the following 
construction. Fix e > and fix an arbitrary A > 0. Let 



0(A') := 



G T (X') if A' e (A - e, A + e) 
otherwise, 



(3.13) 



where Gt is the spectral transform of Gt- Inserting (3.13) into Ab- 
stract Parseval's Theorem (2.4) we have: 

(G T ,0> = 



G T ,' 

A+e 
A-e 



G T (A')0(A / )^(A / ) 



Spec(r\e) 
G T (X')\ 2 du(X'), 



(3.14) 



and 



ll(A-A)0|| 



Spec(r\H) 
A+e 



2 \ V2 

(A'-A)0(A / ) di/(A') 



lA'-AI 



< e 



A-e 

A+e 2 

Gt(A') du(X') 



A-e 



G T (X') dv(X') 

1/2 



1/2 



(3.15) 



Inserting (3.14) and (3.15) into ( |3.11[ ), we have: 

^A+e 

\G T (X')\ 2 du(X') < a ,t£ 



A-e 



A+e X 1/2 

\G T {X')\ 2 dv{X') 



A-e 



If the left side is zero, we are done. If not, we have for an arbitrary A: 

"A+e 

i/M2j..nA „ 2 



\G T (X')\ 2 du(X') < a ,t e- 



A-e 



Let /(A) = f x , <x \GT(X')\ 2 dis(X'). Then / is everywhere continuously 
differentiable with 

-A+e 



f'(X) = lim 

e^O 



/(A + e) - /(A-e) 
2e 



lim — 

e^o 2e 



\G T (X')\ 2 dv(X') = 0. 



A-e 



So as /' = uniformly and /(0) = 0, we have shown that / = 0, and 
Gt = a.e. 

This concludes the proof of the Main Identity. □ 
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4. Preliminaries 

4.1. Sums over wt- Once again let T C SL2(Z) and let C T be 
the stabilizer of infinity. Fix e > and let 

^ = ^ £ e L 2 (r M \i) (4.1) 

be an e-approximation to the identity about zq = i. By this we ask 
that ip 

(1) be smooth, nonnegative, 

(2) have total mass one f r ^ m ip = 1, and 

(3) be supported in a small neighborhood about z = i (a ball of 
radius e/10 will suffice). 

Recall the characteristic function \t from (3.1). Define the function 
w T = w Tj£ : T^yr -> [0, l] by 



wr{l) ■= / XT{z)^ £ {lz)dz. (4.2) 
For 7 G T having bottom row (c, d) it is easy to see that 



, ( . 1 if c 2 + d 2 <T/(l + e) 
^> = < ifeW>T/(l- £ ) (4 ' 3) 



We first prove the following lemma. Let H be a subgroup of T having 
finite index in T. Then both groups S and T have the same limit set, 
Hausdorff dimension 5 and base eigenvalue Ao = 5(1 — 5). Assume 
further that H contains Too. 

Lemma 4.1. Let r ro C H C T with [T : H] < oo and fre defined 
by (4.2). Lei Ai = Si(l — si) &e i/ie /irst eigenvalue above the base in 
Spec(C7\HI), and assume Ai < 1/4. Then for any fixed 7 G T, there 
exists a constant cr, e > depending only on Y and e such that 

(4.4) 

«eroc\H 

where the implied constant depends on V but not 5, 7 or e. Moreover, 



c r , e = c r (l + 0(£)), (4.5) 

where cr > does not depend on e. 

If Ai > 1/4, replace the error term in (4.4) 0(|T 1//2 log T). 

Proof. Throughout we will suppress dependence on e until it is conve- 
nient. 
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Let be the function defined on S\H which is the average of the 
characteristic function \t over the group S: 



F~(z) 



Similarly average if) over S to get 



(4.6) 



(4.7) 



which is an e-approximation to the identity about zq = i in L 2 (S\H). 
Notice that ip Zo {^~ l z) = tp yzo (z), where the latter function is an ap- 
proximation to the identity about 720- 

Replace 7 in ( 4.4[ ) by 7 _1 for convenience. Input the definition (4.2) 
into the left hand side of (4.4) and repeatedly unfold and refold the 
integrals: 



E 
E 



Xt (2) ^o(^7 



roo\H 



Xt(^)^o(^)^ 



roo\H 



/ xAz) 52 ^o(^) dz 



XT{z)^%Az)dz 



r oc \e 

E 



B\I 



Ff(z)V* Z0 (z)dz 

S\H 



(4.t 



where the inner product above (•,•)= is with respect to the Hilbert 
space L 2 (S\M). The above exchanges of summation and integration 
are justified since everything in sight is nonnegative and convergent - 



the sum on the left hand side of (4.8) has finitely many terms by (4.3). 
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By Abstract Parseval's Theorem (2.4), 



70 /Spec(S\H) 

F|(A )^ (A ) + 



(4.9) 

f?(A)§* (A)di*(A), 



Spec(~\H)-{A } 



where z/= is the spectral measure on Spec(H\HI) (see Theorem 2.4). By 
(2.6) and multiplicity one of the base eigenvalue A = 5(1 — 5) we have 



if(Ao) 



(4.10) 
(4.11) 



where 0q is the Patterson-Sullivan base eigenfunction in L 2 (S\H), nor- 
malized to have unit norm. Recall this function is real and nonnegative. 
It is elementary to verify that 



1 



(4.12) 



Unfolding (4.10) and inserting (4.12), we have 



if(Ao) 



( F T^o) 3 



\H 

roo\E 
1 



XT{z)(f)t{z)dz 



l/T 



0q (z)dx 



N 



dy 

yZ 



(4.13) 



where iV is a finite interval corresponding to the width of the parabolic 
at infinity in r\H. The key step here is that we have assumed Too C 
5 C T, so 5 contains the same maximal unipotent subgroup as V. Since 
Too is assumed to be nontrivial, the Hausdorff dimension of the limit 
set of T satisfies 

5 > 1/2. 

Therefore 0o is an eigenfunction of the Laplacian A with eigenvalue 
Ao = 5(1 — 5), the constant term in its Fourier expansion inherits the 
same differential equation. Therefore the inner integral in (4.13) is 
ay 5 + fiy 1 ' 5 , where a and (3 are some periods (constants depending on 
0o)- Inserting this into (4.13) and computing the elementary integral, 
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r^x & 
T + 



1-5 



■T 5 + 0(T 



l/2\ 



(4.14) 



where c' = > since 0o is nonnegative. Note that d only depends 
on T and not on H or e (which has been suppressed until now). 
Returning to (4.11), use (4.12) to define cr, e by 

[F7S[ = Tffnj ^ = [f^S] * H , (4-15) 



as 0o is 2-invariant. Using (4.15), we see that the main term in (4.9) 
coincides with the corresponding term in (4.4). 

For more precise information on cr, e , in particular its dependence on 

(4.16) 



e, use the Mean Value Theorem and (4.12) to get 



Js\m 



+ 



H\H 



0o (7^o) + sup o (w)-£ 

yMJG-B e (72o) 

$(*b) + 0(e), 



(4.17) 



since 0q is T-automorphic and J = 1. Thus (4.17), together with 
(4.15), verifies (4.5), where explicitly, 



c r = c'0o(i) 



Returning to the rest of the spectrum in (4.9), we apply the Main 
Identity (3.8) (valid for arbitrary T, in particular for T = S) to the 
error term: 



Err 



F#(A)** (A)^h(A) 



(4.18) 



Spcc(H\H)-A 



K T (A)Ff(\)^l(\)dv~(\) 



Spcc(H\H)-A 



+ 



L T (A)F^X)^l(\)du s (\), 



Spec(H\H)-A 
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where b < 3. Assume Ai < 1/4. By ( pT5| ), K T {A)F?{X) = K T (\)F~(X), 
so together with Cauchy-Schwarz and the bound (3.6) we have 



K T (A)Ff(X)^{X)d^(X)^T^\\Ff\\ s \\^f\\ Bi 

Spec(H\e)-A 

(4.19) 

where || • ||= is the norm on L 2 (S\H). Clearly the inclusion H C T and 
positivity of xt implies the pointwise bound 

*f(*)= E Xt(^)< E Xt(t*) = *i 
56roo\H 7eroo\r 

Applying the pointwise bound directly gives 



I Ft ||s ^ || -^*T 1 1 H 



vlrTsJll^llr, 



but this is not good enough for us - we will lose information in the 
sieve! Instead, we can exploit the positivity of F£ to unfold the L 2 
norm with respect to one copy of , apply the pointwise bound to the 
other copy, and refold again: 



\H 



E Xt^z) | F$(z)dz 
v £eroo\s 



XT{z)Ff{z)dz 



< 



r 

F r M 2 



■Tiir, (4-20) 
this time losing no information! Note that we have again used crucially 
the fact that cBcT. 

As tyf is an ^-approximation to the identity on a two-dimensional 
space, we can choose it so that 

II* £ S ||h«-, (4.21) 



where the implied constant is independent of H. Combining (4.20) with 



(4.21), inserting into (4.19), and carrying out the same computation 



with Lt replacing Kt, we have 



1. 



p~18] ) < -T Sl ||F 3 r || r < ^T Sl , 

since b < 3. The case Ax > 1/4 is similar. 
This completes the proof of the Lemma. 



□ 
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The method of proof allows a much more general statement. Instead 
of pulling out only the contribution from the base eigenvalue Aq in 



(4.9), we can take more terms. Let 9 G (1/2,5) be a fixed constant, 



and denote the eigenvalues in Spec(S\EI) below 9(1 — 9) by 

5(1 - 5) = A < Ai < A 2 < • • • < A M < 9(1 - 9). (4.22) 

Corresponding to each point eigenvalue Xj is a normalized L 2 eigen- 
function, <\>j. It may be the case that <pj (which a priori is only S- 
automorphic) is also automorphic with respect to some group Tj which 
satisfies H C Tj C T. Then <fij is an "oldform" on S and <f)j G L 2 (Fj\EI). 
This means Xj — Sj(l — Sj) G Spec(Fj\EI) and the same analysis as 
above gives 

F|(A J )§ s (A J ) = ^^ + 0(T 1 /2), 

where Cj depends on Tj but not on S. After extracting these lower 
order terms, the remaining error is simply O(^T ). Suppressing the 
precise dependence on e (which will be fixed for the remainder of this 
section), we have proved 

Lemma 4.2. Let C H C T with [T : H] < oo and wt be defined 
by ( 4.2[ ). Fix 9 E (1/2,5) and let the eigenvalues in Spec(S\H) below 



9(1 — 9) be denoted as in (4.22). For each j = 1, . . . , M let Tj denote 
a group satisfying E C V j C T such that Xj G Spec(rj\H). 

Then for any fixed 7 G T, there exists a constant cr > depending 
only on T and e, and constants Cj depending on Tj and e, such that 

4.2. Sums over a n . Recall the notation 7(7) = c 2 + d 2 for a matrix 
7 G T having bottom row (c, d). For n > 1 let 

a n (T):= w T(l) ( 4 - 24 ) 

76roo\r 

be a smoothed count for the number of elements in our orbit having 
height bounded by T and /-value exactly equal to n. 

Recall that r M C V is the group which stabilizes ^(7) = c 2 + d 2 in 



the sense that for 7' G T^, we have f( n f' r y) = /(7). Recall from (2.2) 
the principal "congruence" subgroup of level q 

T(q) ;={ 7G r: 7 = /(g)}. 
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This group stabilizes all 7 modg, i.e. if 7' G F(q) then 7*7 = 7(g). 
Similarly, let 



ri(g) := {7 e T : 7 



1 



(mod g)} 



(4.25) 



be the subgroup of T which stabilizes /(modg), i.e. if g G r\(g) then 
/(^7) = f( r y)(q). The inclusions Too C r\(g) and T(g) C Ti(q) C T are 
immediate. In particular, Spec(ri(g)\H) C Spec(r(g)\HI) and Ti(q) 
inherits the spectral gap properties of T(q). 
Fix q > 1 square-free and consider 

J2 a n( T )- ( 4 -26) 

n=0(q) 



Insert (4.24) into (4.26) and decompose 7 G into 7 = £71 with 

e G r^r^g) and 7l G rx(g)\r: 

nsEO(g) 



76roo\r 

/(7)=0(?) 



7i6ri(g)\r ^erooVr^,) 

/« T1 )=0(g) 



(4.27) 



/(7l)s0(«) 



since /modg is invariant under I\(g). Apply Lemma 4.2 with 



Ti(q) to the inner sum in (4.27) to prove 



Proposition 4.3. LetT > 1, q > 1 be square-free and a n (T) be defined 



by (|4.24|). Let 9 be the spectral gap of T and let the eigenvalues in 

be denoted by 

< \ M ( q) (q)< 0(1-0) ■ 



Spec(rT(g)\H) below 0(1 - 

5(1 - 5) = A < Ax(g) < A 2 (g) < 

For each j = 1, . . . , M(q) let qj denote a divisor qj\q such that 

A,(g) = Sj (q)(l - Sj (q)) G Spec^^g^W). 

Then there exists a constant cr > depending on T and e, and 
constants Cj depending on qj, such that 



n=0(q) 



c T T s 



M(q) 



+ E 



[r:Ii(g)] ^ : I^g)] 



+ 



where 

1 a 



|{(c,d)(modg) G | c 2 + d 2 = 0(g)}| 



E 




7ier 1 ( 9 )\r 

/(7l) = 0(9) 
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5. Proof of the Main Theorem 



In this section we prove Theorem 1.3 



5.1. Part (1). We begin by proving ( |1.5[ ). Let 

H £ (T) :=$> n (T), 



with a n (T) defined in (4.24) and implicitly dependent on e via wt, £ in 
fl4~2j ). Recall the definition of 0(T) from p~S] ). By g3| we have 



\0(T/(l + e))\<H e (T)<\0(T/(l-e))\., 
or equivalently, 

H e (T(l-e)) < \0{T)\<H e {T(l + e)). 
Apply Lemma 4.1 with H = T and 7 = J: 



H £ (T) = (cr + 0( £ ))T 5 + 0(-T Si ; 



(5.1) 



(5.2; 



where Ai = si(l— si) is the first eigenvalue above the base in Spec(r\H), 
satisfying 5 > S\. Combining (5.1) with (5.2) we elementarily arrive at 
(1.5) by an appropriate choice of e. 



5.2. Part (2) . Our only remaining task is to verify all of the conditions 
necessary to apply a combinatorial sieve to A = {a n (T)} (see £2.3). By 



(4.3), the sequence a n (T) defined by (4.24) satisfies (2.9) with T(l + e) 
replacing T. Throughout the rest of the section, e will be a fixed small 



constant. Anyway since the bound on R in (2.11) depends on logT, 
this difference is irrelevant. 

Let 9 G [1/2,5) be a spectral gap for V. There are at most finitely 
many primes for which Strong Approximation fails, and also finitely 
many primes at which the corresponding spectrum fails to have a 9- 
gap. Let 23 be the product of the primes in these finite "ramified" 
sets. 

For ease of exposition, assume first that q > 1 is square-free and 
relatively prime to 03. Then the projection 



SL 2 (Z/gZ) is onto, 



(5.3) 



and 



Spec(r(g)\H) n (0, 9{l - 9)) = Spec(r\H) n (0, 9{l - 9)). (5.4) 
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Apply Proposition 4.3 to (2.8), and infer from (5.4) that we can take 
qj = 1 for all j: 



i a 



where all c/s only depend on F (and e, which is fixed) but not on q. 
Then \A q \ = u(q)X + r(q), with 



X 



[r:r x ( g )]' 



and 



r(q) = 0{\O q \T e ). 



(5.5) 



Trivially uj(1) = 1. By (4.29) and (5.3) it is clear that oo is multiplica- 
tive, and for q = p a prime we can compute u){p) exactly. 

As p is unfamified, \O p \ counts the number of (c, d)modp with 
(c, <f) 7^ (0,0) and c 2 + d 2 = 0(p). The last equation is equivalent 
to c 2 = —d 2 (p), which has no solutions if —1 is not a square mod p, 
that is, when p = 3(4). In the opposite case, this cardinality is eas- 
ily computed by hand. Similarly, [r : Ti(p)] counts the number of 
(c, d) modp with (c, d) ^ (0, 0). Thus we have 



{1 if p = 2 

2(p_i) ifp = l(4) 

ifp = 3(4), 
[r:^)] = p 2 -l, 



(5.6) 



and so 



u(p) 



\Q P \ 

[r : r x ( P )] 



if p = 2 
if p = 1(4) 
if p = 3(4). 

Then the sieve condition (S|2|) is obvious, and (£|3|) follows from 

-i 




n 

p=l(4) 



k log z, 



a classical exercise (see e.g. |Lan53j ). Inserting (5.6) into (5.5) gives 

r(q)=0(qT e ), (5.7) 
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51) requires 



J2\r(q)\ =0{Q 2 T e ) <T 5 /( 1+e \ 

q<Q 

for any e > 0. This is satisfied for 

q = T w+?). (5.8) 
Then inputting (5.8) into (2.11), together with (2.10), gives (1.6) for 

2 log T/ log Q = 4/(5-0), 

since e > was arbitrary. 

This completes the analysis of the affine linear sieve in the case q is 
"unramified" . 

5.3. Ramified places. Let q = q'q" > 1 be square-free, with g'|03 and 
(q", 03) = 1. Notice that 03 being the product of a finite number of 
primes means there are only finitely many possible values of q'. 
Then by Theorem 2.1 the projection of T in SL^Z/gZ) is 

Gg> x SL 2 (Z/g"Z), (5.9) 

where G q > is the projection of Y in SL 2 (Z/g'Z). 

By Theorem |273j if A < 6(1-6) is in Spec(r(g)\H) but not in 
Spec(r\H), then A G Spec(r(g')\H). 

Then applying Proposition 4.3 and using the fact that 

[T:E] = [T:T J }[T r .E} 
for H C Tj C T, we have 

\A,\ = \o,\ ( [r .r lM] (* + *<) + °'( T ^) • 

E 



where 



X = c r T a 



A J -eSpec(r\H)-{A } 



and 



X q , = [T : l\(g')] 



E 



Aj6Spec(r 1 ( 9 ')\H)neu; 

Here c/s depend on V and not on q, while c*'s depend on q' but not 
on q". Clearly X q , < X 1 ^ for some 77 > 0. By = \O q ,\\O q „\ 

and [ri(g") : I\(g)] = [r : Ti(q')], which is independent of q" . 
Therefore we have the expression 

\A q \=uo(q)(X + X q ,) + r(q), 
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where 

and r(q) satisfies ( |5.7 ). The rest of the analysis follows as before, 
completing the proof of the Main Theorem. 



Appendix A. Proof of Theorem 12.51 



In this appendix we derive Theorem |2 . 5| from the more standard sieve 
setting. See |Iwa961 IIK04] . 

As before, let A = {a n } be our sequence of nonnegative numbers 
with a n = for n exceeding a parameter T, and let 03 be the product 
of a finite set of "bad" primes. Recall that X is an approximation to 
\A\: 

n 

and that for q = q'q" square-free with q'\*B and (q", 23) = 1 we have 
\A\ = J2 a n = oj{q){X + X q ,)+r(q). 

n=0(q) 

It is assumed that for any divisor q'\*B, the factor X q > is a power less 
than the main term X. Also u is multiplicative away from 23, i.e. 
u>(q) = ou(q') Y[p\ q » u (p)- Let z > 2 be a parameter (a small power of 
X) and define 



P(z) = n 



p<z 

Assume z is large enough (by taking T large enough) so that 23|P(z). 
Consider a sum of the form 

S{z) := ^2 a n- 

{n,P{z))=l 

Notice that if z = T a , n < T and (n, P{z)) = 1 then n has at most 
1/a prime factors. In this way, S(z) counts the number of 1/a-almost 
primes. 
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By Mobius inversion, we have 

S{z) = Y ttn Y M 

n q\(n,P{z)) 

= Y kq) Y a « 

q\P(z) n=0(q) 

= Yl MHq)(X + X q/ )+r(q)) 

q\P(z) 

= Si + E2 + £3, 



with 



Ei = Y M?M?)* = x - ^(p)) x Y MsW) 

q\P(z) q'\<B 



^2=Y KqMq)x< = II^ 1 " w (p)) x ( ^taW)** I > 

and 



^3= Y m^m?)- 

q\P{z) 

Let 

S*{z) = Z 1 + Z 3 . 

Theorem A.l (See Iwaniec [Iwa96j . Theorem 3.7 on page 63). Let 
Q > e 2K where K is the constant appearing in pi) and define 



v(z) = n( 1 -^))' 



p<z 

R(Q) = 5>(g)| 

q<Q 

f(s) = 2ei log(s-l)/s, for S 6 [2,4], 
F(s) = 2e 7 /s ; for s G [1,3], and 

d = cir 11 (iogio g iogg) 3 (iogio g g)- 1 . 

Here 7 = .577... is the Euler constant and c = 3.591... solves 
(c/e) c = e. Then 

(f(s) - D)XV{z) - R(Q) < S*{z) < {F{s) + D)XV{z) + R(Q), 

where s = log Qj log z. 
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s-e 

The choice Q = T 2 ( 1+e ) gives 

D < (log log log T) 3 (log log T)- 1 -> 0, 

so we can take 

s = 2(l + e), (A.l) 

giving f(s)>D for T sufficiently large. 

s-e 

Then with z = Q 1/s = T 4 ^) 2 we have 

S*(z)~^~T s /logT^S(z), 

since S 2 <C X l ^ n . As e > is arbitrary, this captures i?-almost primes 
with any R > 4/(5 - 9). 

Remark A. 2. The set Zr is genuinely the set of integers having at 
most R prime factors, not just R prime factors outside of *B. 

Appendix B. Proof of Lemma 13.31 

We require the following simple lemma from the theory of inhomo- 
geneous ODEs, in particular the method of variation of parameters. 

Lemma B.l. Let A > and suppose f and g are functions satisfying 

a 2 

dy 2 

Assume A = s(l — s) 7^ 1/4. Then there exist constants a and f3 
such that 

f(y) = ay s + (3y l - s + u(y)y s + v(y)y 1 ~ s , 

where 

ry 

u ( y ) = (1 - 2s)" 1 / w- l ~ s g(w)dw (B.l) 

Jl/T 

and 

ry 

v(y) = (2s - l)" 1 / w s - 2 g(w)dw. 

Jl/T 

If\ = 1/4 then 

f{y) = ay 1/2 + (3y 1/2 logy + u(y)y 1/2 + v(y)y 1/2 log y, 

where 

ry 

u (y) = / w~ 3 ^ 2 \og{w)g{w)dw , and 

Jl/T 

ry 

v {y) = ~ w~ 3 / 2 g(w)dw. 

Jl/T 



-y 2 — 2 f(y)-V(y) = g(y)- 
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Proof. Elementary calculus. For details see |Kon07] . 



□ 



Fix any e L 2 (r\HI) and any A > 0. For simplicity assume A ^ 1/4 
(the calculation in the opposite case is similar). Consider the left hand 
side of (3.9) and unfold: 



F T (z)<f>(z)dz 



r\H 

oo 



l/T \J N 



(fi(z)dx 



dy 



y 



2 - 



where again N is an interval corresponding to the width of the cusp at 
infinity. 
Let 



so that 



Let g be defined by: 



f(y) 

{Ft ,0) 



4>(z)dx, 



N 



i/T y 



& 



g(y) :=-y'—f(y)-\f(y) 



(A - X)(f)(z)dx. 



(B.2) 



(B.3) 



N 



By Lemma B.l[ 



f(y) = ay s + (3y l ~ s + y s u(y) + y^'viy). 

The first two terms are the homogenous solution and the last two are 
the perturbation. Of course inserting the homogenous component of / 
into (B.2) we have the main term in (3.9) 



(ay 8 + /V~ s ) %=AT S + BT 1 -' 



'i/T y 

Thus it remains to show that I, II ^Ca,t — \)<f>\\, where 



y s u(y)-^, and II 



°° i_. f Jy 

i/T y 



<i/t y 

are the contributions from the perturbation. Integrate / by parts and 
recall from pi) that u(l/T) = 0: 

-l 



u{y) 



y 



l 



OO yS -l 



l/T 



-1-s 



9(y) 



1 - 2s 



dy. 



(B-4) 
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Modulo constants, insert (B.3) into the last integral of (B.4) and apply 
C auchy- S hwarz : 



l/T J N 



(A - X)(f)(z)\ fl 

y 



y 



l/T J N 



(A - \)ct>{z) 



dx dy 

1/2 



dx dy 



l/T J N 



2 \ V2 
dx dy 



« w ||(A-A)0|| (\N\T) 1/2 (B.5) 
< A)T>r ||(A-A)0||, 

where \N\ is the length of N. Here we used the fact that the box 

N x [l/T, oo]cH 

is contained in a union of finitely many (depending on T) fundamental 
domains for r\H. 

Remark B.2. The appearance of \N\ in ( B.5[ ) (via the use of Cauchy- 
Shwarz) is the most severe obstruction to removing the assumption 
that T stabilize infinity. If there is no cusp at infinity, then \N\ = oo 



and our analysis fails. See Remark 1.7 



For the first part of (B.4), we need the following bound: 



u(y) 



1 - 2s J 1 
1 



IT 



w s g(w)dw 



1 - 2s 



y r (A - \)<f>( x + iw) 

l/T J N 



W 



l/T J N 
V 



(A - \)<f){x + iw) 



x 



w 



\w s \ dxdw 



■ w s dx dw 

1/2 



dx dw 



'l/T J N 

« AiT ||(A-A)0|| 



1/2 



y l-2s + T 2s-l if s > 1/2 

logy + logT if 9te(s) = 1/2. 



So 



hm u(y)y s -^ XtT ||(A-A)0||. 



The integral II is handled identically and we are done. 
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Note added in proof: In joint work with Hee Oh |KO08aJ, we cir- 
cumvent the Main Identity to prove the Main Theorem without the 
assumption that is nontrivial. Instead of the Main Identity, we 
prove the equidistribution of long horocycle flows on the unit tangent 
bundle of an infinite- volume Riemann surface of constant negative cur- 
vature, and then use this equidistribution to count. Furthermore, we re- 
place the j3 sieve by the weighted sieve of Diamond-Halberstam-Richert 
[DHR88J, which gives better numbers under nearly identical hypothe- 
ses, and execute the sieve for various other choices of / in (1.1). We 
also use these methods in [KO08b] to count the number of circles in 
an Apollonian packing of bounded curvature, and discuss various Dio- 
phantine properties of integral Apollonian packings. 
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